Method and apparatus for measurements of patterned structures

ABSTRACT

A method for measuring at least one desired parameter of a patterned structure having a plurality of features defined by a certain process of its manufacturing. The structure represents a grid having at least one cycle formed of at least two locally adjacent elements having different optical properties in respect of incident radiation. An optical model is provided, which is based on at least some of the features of the structure defined by a certain process of its manufacturing, and on the relation between a range of the wavelengths of incident radiation to be used for measurements and a pitch of the structure under measurements. The model is capable of determining theoretical data representative of photometric intensities of light components of different wavelengths specularly reflected from the structure and of calculating said at least one desired parameter of the structure. A measurement area, which is a grid cycles containing area and is substantially larger than a surface area of the structure defined by one grid cycle, is located and spectrophotometric measurements are applied to the measurement area, by illuminating it with incident radiation of a preset substantially wide wavelength range. A light component substantially specularly reflected from the measurement area is detected, and measured data representative of photometric intensities of each wavelength within the wavelength range is obtained. The measured and theoretical data are analyzed and the optical model is optimized until the theoretical data satisfies a predetermined condition. Upon detecting that the predetermined condition is satisfied, said at least one parameter of the structure is calculated.

This is a continuation-in-part of application Ser. No. 09/267,989, filed Mar. 12, 1999, U.S. Pat. No. 6,100,985 which is a continuation-in-part of Ser. No. 09/092,378, filed Jun. 5, 1998 now abandoned.

FIELD OF THE INVENTION

This invention is in the field of measurement techniques and relates to a method and a system for measuring the parameters of patterned structures.

BACKGROUND OF THE INVENTION

Techniques for thickness measurements of patterned structures have been developed. The term “patterned structure” used herein, signifies a structure formed with regions having different optical properties with respect to an incident radiation. More particularly, a patterned structure represents a grid having one or more cycles, each cycle being formed of at least two different locally adjacent stacks. Each stack is comprised of layers having different optical properties.

Production of integrated circuits on semiconductor wafers requires maintaining tight control over the dimensions of small structures. Certain measuring techniques enable the local dimensions of a wafer to be measured with relatively high resolution, but at the expense of discontinued use of the wafer in production. For example, inspection using a scanning electron microscope gives measurements of the parameters of a patterned structure, but at the expense of cleaving it and thus excluding it from continued processing. Mass production of patterned structures such as wafers requires a non-destructive process for controlling thin film parameters in a manner enabling the local measurements to be performed.

One kind of the conventional techniques for measuring thickness of thin films is disclosed in U.S. Pat. No. 4,999,014. The technique is based on the use of small spot size and large numerical aperture for measurements on small areas. Unfortunately, in the case of a very small structure, this approach suffers from a common drawback associated, on the one hand, with the use of a small spot-size and, on the other hand, owing to the large numerical aperture, with the collection of high diffraction orders. The term “small spot-size” signifies the spot diameter similar in size to the line or space width of the measured structure, i.e. a single grid cycle. This leads to various problems, which are difficult to solve. Indeed, not all the stacks' layers are in the focus of an optical system used for collecting reflected light, the optical system being bulky and complicated. Detected signals are sensitive to small details of a grid profile and to small deviations in the spot placement. Diffraction effects, which depend significantly on the grid profile and topography and therefore are difficult to model, have to be included in calculations.

Another example of the conventional techniques of the kind specified is disclosed in U.S. Pat. No. 5,361,137 and relates to a method and an apparatus for measuring the submicron linewidths of a patterned structure. The measurements are performed on a so-called “test pattern” in the form of a diffraction grating, which is placed in a test area of the wafer. Here, as in most conventional systems, a monochromatic incident light is employed and diffraction patterns are produced and analyzed. However, a large number of test areas are used and also information on multiple parameters cannot be obtained.

According to some conventional techniques, for example that disclosed in U.S. Pat. No. 5,087,121, portions with and without trenches are separately illuminated with broadband light, the reflection spectrum is measured and corresponding results are compared to each other with the result being the height or depth of a structure. However, it is often the case that the structure under inspection is such that the different portions cannot be separately imaged. This is owing to an unavoidable limitation associated with the diameter of a beam of incident radiation striking the structure.

The above approach utilizes frequency filtering to enable separation of interference signals from different layers. This is not feasible for layers of small thickness and small thickness difference because of a limited number of reflection oscillations.

Yet another example of the conventional technique for implementing depth measurements is disclosed in U.S. Pat. No. 5,702,956. The method is based on the use of a test site that represents a patterned structure similar to that of the wafer (circuit site), but taken in an enlarged scale. The test site is in the form of a plurality of test areas each located in the space between two locally adjacent circuit areas. The test areas are designed so as to be large enough to have a trench depth measured by an in-line measuring tool. The measurements are performed by comparing the parameters of different test areas assuming that the process is independent of feature size. For many processes in the field such as etching and photoresist development, this assumption is incorrect and this method is therefor inapplicable.

SUMMARY OF THE INVENTION

It is a major object of the present invention to overcome the above listed and other disadvantages of the conventional techniques and provide a novel method and system for non-destructive, non-contact measurements of the parameters of patterned structures.

It is a further object of the invention to provide such a method and system that enables the relatively small amount of information representative of the structure's conditions to be obtained and successfully processed for carrying out the measurements, even of very complicated structures.

According to one aspect of the present invention, there is provided a method for measuring at least one desired parameter of a patterned structure which represents a grid having at least one cycle formed of at least two locally adjacent elements having different optical properties in respect of incident radiation, the structure having a plurality of features defined by a certain process of its manufacturing, the method comprising the steps of:

a) providing an optical model, which is based on at least some of said features of the structure and on relation between wavelength range of the incident radiation to be used for measurements and pitch of the structure under measurements, and is capable of determining theoretical data representative of photometric intensities of light components of different wavelengths specularly reflected from the structure and of calculating said at least one desired parameter of the structure;

b) locating a measurement area for applying thereto spectrophotometric measurements, wherein said measurement area is a grid cycles containing area and is substantially larger than a surface area of the structure defined by one grid cycle;

c) applying the spectrophotometric measurements to said measurement area by illuminating it with incident radiation of a preset substantially wide wavelength range, detecting light component substantially specularly reflected from the measurement area, and obtaining measured data representative of photometric intensities of each wavelength within said wavelength range;

d) analyzing the measured data and the theoretical data and optimizing said optical model until said theoretical data satisfies a predetermined condition; and

e) upon detecting that the predetermined condition is satisfied, calculating said at least one parameter of the structure.

Thus, the main idea of the present invention consists of the following. A patterned structure, whose parameters are to be measured, is manufactured by several sequential steps of a certain technological process completed prior to the measurements. Actual design-rule features can often be found in the structure in sets (e.g. read lines in memories). The term “design-rule features” signifies a predetermined set of the allowed pattern dimensions used throughout the wafer. Hence, information regarding the desired parameters can be obtained using super-micron tools such as a large spot focused on a set of lines.

The present invention, as distinct from the conventional approach, utilizes a spectrophotometer that receives reflected light substantially from zero-order. The zero-order signal is not sensitive to small details of the grid profile of the structure such as edge rounding or local slopes. This enables the effects associated with diffracted light not to be considered, and thereby the optical model, as well as the optical system, to be simplified. Moreover, the large spot-size enables large depth of focus that includes the whole depth of the structure to be measured. When the spot includes a number of grid cycles, then the measurement is insensitive to local defects, exact spot placement or focusing.

In the case of wafers, each such element in the grid cycle consists of a stack of different layers. The features of such a structure (wafer), which are dictated by the manufacturing process and should be considered by the optical model, may be representative of the following known effects:

specular reflection from the different stacks within the grid cycle;

interference of reflected light from layers within each stack;

dissipation within transparent stacks due to cavity-like geometry formed in the grid-like structure;

specular contributions due to width of stacks relative to the wavelength;

polarization due to the incident beam interaction with a conductive grid-like structure, if present;

effects due to limited coherence of illumination;

interference between light beams reflected from each stack within the grid cycle, taking into account the above effects.

The contribution of each of the above effects into the theoretical data are estimated in accordance with the known physical laws.

The optical model, being based on some of the features, actually requires certain optical model factors to be considered in order to perform precise calculations of the desired parameters. If information of all the features is not available and the model cannot be optimized prior to the measurements, this is done by means of a so-called initial “learning” step. More specifically, there are some optical model factors which, on the one hand, depend variably on all the features and, on the other hand, define the contribution of each of the existing optical effects into the detected signal. The values of these optical model factors are adjusted along with the unknown desired parameters during the learning step so as to satisfy the predetermined condition. The latter is typically in the form of a merit function defining a so-called “goodness of fit” between the measured and theoretical data. The resulting optical model factors can consequently be used in conjunction with known features to enable precise calculations of the desired parameters of the structure.

Preferably, the measurement area is the part of the structure to be measured. Alternatively, the measurement area is located on a test pattern representative of the actual structure to be measured, namely having the same design rules and layer stacks. The need for such a test pattern may be caused by one of the following two reasons:

1) If the measurement area is not substantially smaller than the available surface area defined by the actual structure to be measured, then the test site is implemented so as to include an extended structure;

2) If the structure is very complicated or consists of ambiguous under-layer structure, then the test site is implemented with the same geometry as that of the actual structure to be measured, but with a simplified under-layer design, thus allowing simplified measurements of the top layers.

According to another aspect of the present invention, there is provided an apparatus for measuring at least one desired parameter of a patterned structure that represents a grid having at least one grid cycle formed of at least two locally adjacent elements having different optical properties in respect of an incident radiation, the structure having a plurality of features defined by a certain process of its manufacturing, the apparatus comprising:

a spectrophotometer illuminating a measurement area by an incident radiation of a preset substantially wide wavelength range and detecting a specular reflection light component of light reflected from the measurement area for providing measured data representative of photometric intensities of detected light within said wavelength range, wherein the measurement area is substantially larger than a surface area of the structure defined by the grid cycle; and

a processor unit coupled to the spectrophotometer, the processor unit comprising a pattern recognition software and a translation means so as to be responsive to said measured data and locate measurements, the processor being operable for

applying an optical model, based on at least some of said features of the structure and on relation between wavelength range of the incident radiation to be used for measurements and pitch of the structure under measurements, for providing theoretical data representative of photometric intensities of light specularly reflected from the structure within said wavelength range and calculating said at least one desired parameter, and

comparing said measured and theoretical data and detecting whether the theoretical data satisfies a predetermined condition.

Preferably, the spectrophotometer is provided with an aperture stop accommodated in the optical path of the specular reflected light component. The diameter of the aperture stop is set automatically according to the grid cycle of the measured structure.

Preferably, the incident radiation and the reflected light received by the detector are directed along substantially specular reflection axes.

More particularly, the invention is concerned with measuring height/depth and width dimensions on semiconductor wafers and is therefore described below with respect to this application.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to understand the invention and to see how it may be carried out in practice, a preferred embodiment will now be described, by way of non-limiting example only, with reference to the accompanying drawings, in which:

FIGS. 1a and 1 b are, respectively, schematic cross-sectional and top views of one kind of a patterned structure to be measured;

FIG. 2 schematically illustrates the main components of an apparatus according to the invention for measuring the parameters of a patterned structure;

FIG. 3 is a graphical illustration of the main principles of the present invention, showing the relationship between measured and theoretical data obtained by the apparatus of FIG. 2;

FIG. 4 illustrates yet another example of a patterned structure to be measured with the apparatus of FIG. 3;

FIGS. 5a and 5 b illustrate a flow diagram of the main steps of a method according to the invention;

FIGS. 6 to 10 are schematic cross-sectional views of five more examples of patterned structures suitable to be inspected by the apparatus of FIG. 2;

FIGS. 11 and 12 illustrate two examples, respectively, of patterned structures to be measured, wherein one-dimensional periodicity in layers is considered;

FIGS. 13A-13C illustrate a two-dimensional structure to be measured, relating to DRAM applications, wherein FIG. 13A is a top view of the structure, and FIGS. 13B and 13C are cross-sectional views taken along lines A—A and B—B, respectively, in FIG. 13A.

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT

Referring to FIGS. 1a and 1 b, there are partly illustrated a cross-section and a top view, respectively, of a grid-like wafer structure, generally designated 10, whose parameters are to be measured. The structure is formed of a plurality of cells, generally at 12, each constituting a grid cycle. Only three adjacent cells 12 are demonstrated in the present example with only two stacks (or elements) in each cell in order to simplify the illustration. Thus, the cell 12 comprises two stacks 12 a and 12 b formed of different layers. More specifically, the stack 12 a includes six layers L₁-L₆, wherein the layers L₁ and L₂ and the layer L₆ form two layers L₁ and L_(2,6), respectively, of the stack 12 b. As known in the conventional semiconductor devices, semiconductor structures such as sources, drains and gate electrodes, capacitors, etc. are formed in and on a semiconductor substrate (layer L₁) typically made of silicon material and including metal conductors (e.g. aluminum). The substrate is coated by an insulating silicon oxide compound (layer L₂). The first level metal layer L₄ (and the single level in the present example) is formed, being interposed between top and bottom barrier layer L₃ and L₅ made of titanium nitride (TiN). Deposition coating of an uppermost insulating silicon oxide layer L₆ and subsequent chemical mechanical polishing (CMP), consisting of thinning the uppermost layer L₆, completes the manufacturing. The construction of such a structure and method of its manufacturing are known per se and therefore need not be more specifically described.

According to this specific example, the parameters to be measured are the widths W₁ and W₂ of the stacks 12 a and 12 b and depths d₁ and d₂ of the uppermost silicon oxide layers L₆ and L_(2,6), respectively. It is appreciated that any other parameters of the patterned structure such as, for example, materials and their optical properties, can be measured.

Reference is now made to FIG. 2 illustrating a system, generally designated 14, suitable for carrying out the measurements. The system 14 may represent one of the working stations of a production line (not shown), the wafers 10 progressing between upstream and downstream stations of the production line. The system 14 comprises a support frame 16 for holding the structure 10 within an inspection plane, a spectrophotometer 18 and a processor unit 20 connected thereto. The spectrophotometer 18 typically includes a light source 22 for generating a beam of light 24 of a predetermined wavelength range, light directing optics 26 and a detector unit 28. The light directing optics 26 are typically in the form of a beam deflector comprising an objective lens 30, a beam splitter 32 and a mirror 34. The detector unit 28 typically comprises an imaging lens 36, a variable aperture stop 38 coupled to and operated by its motor 40 and a spectrophotometric detector 42. The construction and operation of the spectrophotometer 18 may be of any known kind, for example, such as disclosed in U.S. Pat. No. 5,517,312 assigned to the assignee of the present application. Therefore, the spectrophotometer 18 need not be more specifically described, except to note the following.

The light beam 24 passes through the light directing optics 26 and impinges onto the structure 10 at a certain location defining a measurement area S₁. Light component 44 specularly reflected from the reflective regions within the area S₁ is directed onto the detector unit 28.

It should be noted that, generally, the illuminated location of the structure may be larger than the measurement area S₁, in which case suitable optics are provided for capturing, in a conventional manner, light reflected solely from the part (area S₁) within the illuminated location. In other words, the measurement area being of interest is included into a spot-size provided by the light beam 24 when impinging onto the structure 10. In order to facilitate understanding, assume that the illuminated area defined by the diameter of the incident beam constitutes the measurement area S₁.

The light directing optics 26 and detector unit 28 are designed such that only a zero-order light component of light reflected from the structure 10 is sensed by the spectrophotometric detector 42. The construction is such that the incident and detected light beams are directed substantially parallel to each other and substantially perpendicular to the surface of the structure 10. The diameter of the aperture stop 38 is variable and is set automatically according to the grid cycle of the measured structure. Generally speaking, the diameter of the aperture stop is optimized to collect the maximum reflected intensity excluding diffraction orders.

Additionally, the diameter of the incident beam 24, defining the measurement area S₁, is substantially larger than the surface area S₀ defined by the cell 12, that is:

S ₁ >S ₀

According to this specific example, the patterned structure 10 is a so-called “one-dimensional” structure. As clearly seen in FIG. 1b, the stacks 12 a and 12 b are aligned along the X-axis, while along the Y-axis the stacks continue to infinity (uniform structure) with respect to the measurement area S₁. In other words, the measurement area S₁ includes a structure that has one or more grid cycles extending along the X-axis and is uniform along the Y-axis.

The whole surface area S of the structure under inspection should be substantially larger than the measurement area S₁ defined by the diameter of the incident beam.

S>S ₁

The case may be such that the above conditions are not available in the structure 10. For example, the structure may contain a single grid cycle. To this end, the measurement area S₁ consisting of more than one cell 12 should be located on a test-site (not shown).

For example, if the system 14 provides the numerical aperture of 0.2 and spot-diameter (measurement area S₁) about 15 μm, the minimum surface area S of a test-site should be 20 μm. NovaScan 210 spectrophotometer, commercially available from Nova Measuring Instruments Ltd., Israel, may be used in the system 14.

The spectrophotometer 18 measures the photometric intensities of different wavelengths contained in the detected, zero-order light component of the reflected beam 44. This is graphically illustrated in FIG. 3, being shown as a dashed curve D_(m) constituting the measured data. The processor unit 20 comprises a pattern recognition software and a translation means so as to be responsive to the measured data and locate measurements. It is pre-programmed by a certain optical model based on at least some features of the structure for calculating theoretically the photometric intensities of light of different wavelengths reflected from a patterned structure. This is shown in FIG. 3 as a solid curve D_(ts) constituting the theoretical data

In order to design the optical model capable of estimating all the possible optical effects, which are dictated by the features of the structure to be measured and affect the resulting data, the following should be considered.

Generally, total specular reflection R from the grid-like structure is formed of a coherent part R_(coh) and an incoherent part R_(incoh). It is known that coherence effects play an essential role in the measurements when a wide bandwidth radiation is used. The coherence length L of light in the optical system is determined by the radiation source and by the optical system (spectrophotometer) itself. Reflection amplitudes from structure's features smaller than the coherence length interact coherently, producing thereby interference effects between light reflected by different stacks of the cell. For larger features, a non-negligible portion of light reflected by different stacks undergoes incoherent interaction without producing interference. The coherence length L defines a mutual coherence ν of light, coming from points separated by half a cycle of the grid structure, and , consequently, defines the degree of coherence γ , that is:

L=D·λ

$v = \frac{2 \cdot \pi \cdot \left( {W_{1} + W_{2}} \right)}{2 \cdot L}$ ${\gamma = \left( \frac{2 \cdot {J_{1}(v)}}{v} \right)^{2}}$

wherein D is a variable parameter determined experimentally for the actual optical system and stack structure based on the measured reflection spectra (measured data) for grids of varied cycle dimensions; J₁ is a known Bessel function. An approximate initial input for the determination of the parameter D may be given by nominal optical system characteristics. Hence, the total specular reflection R is given:

R=γ·R _(coh)+(1−γ)·R _(incoh)

In order to estimate the possible optical effects affecting the above parts of the total reflected signal, the following main factors should be considered, being exemplified with respect to the patterned structure 10 (FIGS. 1a and 1 b):

1) Filling factors a₁ and a₂: $a_{1} = \frac{W_{1}}{W_{1} + W_{2}}$ $a_{2} = \frac{W_{2}}{W_{1} + W_{2}}$

These factors represent the zero-order contribution, which is based only on the ratio of the areas of stacks 12 a and 12 b, respectively, in the reflection calculation. The zero-order signal is not sensitive to small details of the grid profile of the structure 10 such as edge rounding or local slopes. Therefore, the effects associated with diffracted light may not be considered.

2) Size coupling factors c₁ and c₂:

When the width of the stack is close to the wavelength, the filling factors a₁ and a₂ should be corrected for reducing the coupling of the incident radiation to the respective stack. To this end, so-called “coupling factors” c₁ and c₂ should be introduced to the filling factors a₁ and a₂, respectively. The coupling factor gives a negligible effect when the width of the stack is relatively large relative to the wavelength and negates the interaction completely when the stack width is much smaller than the wavelength. Using a heuristic exponential function to give this dependence, the coupling factors are as follows: ${c_{1} = {\exp \left\{ {{{- A} \cdot \exp}\frac{\lambda}{W_{1}}} \right\} {{c_{2} = {\exp \left\{ {{{- A} \cdot \exp}\frac{\lambda}{W_{2}}} \right\}}}}}}$

wherein λ is the wavelength of a respective light component; A is a variable factor depending on the dimensions and materials of the structure and is determined experimentally for the actual stack structure, as will be described further below.

3) Dissipation b₂ in cavity-like structures:

It is often the case that one of the stacks is essentially dissipative owing to geometrical effects reducing reflection, which effects typically take place in cavity-like structures. Among these geometrical effects are high aspect-ratio trenches and wave-guiding underneath metal grid-like structures. High aspect-ratio structures are characterized by a dissipative effect that decreases the amount of light reflected back out with phase impact. For example, multiple reflections in deep grooves in metal both reduces the amount of light reflected back out and destroys the phase relation. The above effects are relatively strong for deep geometry and relatively weak for shallow structures (relative to the wavelength). Using a heuristic exponential function to give this dependence, a dissipation factor b₂ is given: ${b_{2} = {\exp \left\{ {{- B} \cdot \frac{d_{2}}{\lambda}} \right\}}}$

wherein B is a variable size parameter, which is determined experimentally for the actual stack structure; d₂ is the depth of the cavity-like part of the stack. Here, by way of example only, the stack 12 b is defined as a dissipative one.

In order to model the corrected filling factors, it is assumed that light radiation not reflected from a certain cell's stack from coupling considerations is essentially reflected by other cell's stack(s). The dissipation factor b₂ is taken into account in the reduced effective filling factor of the geometrically dissipative area. Hence, the corrected filling factors are as found:

A ₁ =a ₁ ·c ₁ +a ₂·(1−c ₂)

A ₂=(a ₂ ·c ₂ +a ₁·(1−c ₁)).b ₂

4) Polarization factors, representing the contribution of polarization effects that may take place in the case of metallic grids:

When the width of a cell's stack is close to the wavelength, a corrective factor should be introduced for reducing the coupling of the incident TE radiation to the respective stack owing to boundary conditions at the edges of metal lines. The polarization factor gives a negligible effect when the width of the stack is large relative to the wavelength and negates the reflection completely when the stack width is much smaller than the wavelength. Hence, the polarization factors p₁ and p₂ are given: ${p_{1} = {\exp \left\{ {{- C}\frac{\lambda}{W_{1}}} \right\} {{p_{2} = {\exp \left\{ {{- C}\frac{\lambda}{W_{2}}} \right\}}}}}}$

wherein C is a variable parameter determined experimentally for the actual stack structure. It is appreciated that in the absence of a pattern formed of metal lines, the optical factor C is equal to zero.

Similarly, in order to model the corrected filling factors, it is assumed that light radiation not reflected from a certain cell's stack from polarization considerations is essentially reflected by other cell's stack(s). Hence, the corrected filling factors are as found:

A ₁ =a ₁ ·c ₁ ·p ₁ +a ₂·(1−c ₂ ·p ₂)

A ₂=(a ₂ ·c ₂ ·p ₂ +a ₁·(1−c ₁ ·p ₁)).b ₂

The intensity of a reflected signal r(λ) from each stack is calculated using layer thickness information and material optical parameters (constituting known features). To this end, standard equations for reflection from multi-layered stacks are used, based on Fresnel coefficients for reflection and transmission at interfaces as a function of wavelength for perpendicular incidence. The thickness for each layer is either known (being provided by the user) or calculated internally by the program. The materials of the layers and, therefore, their optical parameters, such as refraction indices and absorption, are known or calculated.

In view of the above and considering that both the coherent and incoherent parts contain contributions from two polarizations (e.g. R_(coh)=R^((p))+R^((s))), the total reflection R_(TOT) constituting the theoretical data obtained by the optical model, is given: $R_{TOT} = {{\left\{ {{{{r_{1} \cdot A_{1P}} + {r_{2} \cdot A_{2P}}}}^{2} + {{{r_{1} \cdot A_{1S}} + {r_{2} \cdot A_{2S}}}}^{2}} \right\} \cdot \frac{\gamma}{2}} + {\left\{ {{{r_{1}}^{2} \cdot A_{1P}^{2}} + {{r_{2}}^{2} \cdot A_{2P}^{2}} + {{r_{1}}^{2} \cdot A_{1S}^{2}} + {{r_{2}}^{2} \cdot A_{2S}^{2}}} \right\} \cdot \frac{1 - \gamma}{2}}}$

wherein r₁ and r₂ are the amplitudes of reflection from first and second stacks, respectively, of the cell, that is stacks 12 a and 12 b in the present example.

Other effects known in common practice (such as lateral reflection, roughness, etc.) have been found to have a negligible contribution under the defined conditions and are accounted for by the adjustment of the parameters A, B, C and D.

Turning back to FIG. 3, there is clearly illustrated that the curves D_(m) and D_(t) do not coincide, that is the theoretical data does not exactly match the measured data A suitable merit function issued for determining the goodness of fit of the obtained results. By setting the values of the optical model parameters A, B, C and D the optical model is defined. By fitting the values of the desired parameters, e.g. W₁, W₂, d₁ and d₂, the theoretical data is optimized until the goodness of fit reaches a certain desired value (constituting a required condition). Upon detecting that the optimized theoretical data satisfies the required condition, the desired parameters of the structure, i.e. the W₁, W₂, d₁ and d₂ are calculated from the above equations.

It should be noted that in the most general case, when the grid cycle comprises two or more locally adjacent different elements (e.g., stacks), the above optical model is still correct. The mutual coherence v′ is as follows: $v^{\prime} = {\frac{\pi}{L}{\sum\limits_{l = 1}^{n}W_{1}}}$

wherein i is the i-th element (stack) in the grid cycle; n is the total number of elements within the grid cycle, and L is the coherence length. For the main factors on which the above optical model is based, we have:

Filling factor $a_{i} = \frac{W_{i}}{\sum\limits_{i = 1}^{n}W_{i}}$

Coupling factor $c_{i} = {\exp \left( {{{- A} \cdot \exp}\frac{\lambda}{W_{i}}} \right)}$

Dissipation factor $b_{m} = {\exp \left\{ {{- B_{m}}\frac{d_{m}}{\lambda}} \right\}}$

wherein m is the number of a dissipative element of the n stacks; d_(m) is the depth of the cavity-like part of the stack in relation to the neighboring stacks. For a non-dissipative stack, b_(n)=1, wherein n^(≠)m.

Polarization factor $p_{i} = {\exp \left\{ {{- C} \cdot \frac{\lambda}{W_{i}}} \right\}}$

Corrected filling factor $A_{j} = {b_{i} \cdot \left\lbrack {{a_{i} \cdot c_{i} \cdot p_{i}} + {\sum\limits_{j = {1{({j \neq i})}}}^{n}{a_{j}\left( {1 - {c_{j} \cdot p_{j}}} \right)}}} \right\rbrack}$

In view of the above, the total reflection R′_(TOT) is as follows: $R_{TOT}^{\prime} = {{\left\{ {{{\sum\limits_{i}\left( {r_{i} \cdot A_{ip}} \right)}}^{2} + {{\sum\limits_{i}\left( {r_{i} \cdot A_{is}} \right)}}^{2}} \right\} \cdot \frac{\gamma}{2}} + {\left\{ {{\sum\limits_{i}\left( {{r_{i}}^{2} \cdot A_{ip}^{2}} \right)} + {\sum\limits_{i}\left( {{r_{i}}^{2} \cdot A_{is}^{2}} \right)}} \right\} \cdot \frac{1 - \gamma}{2}}}$

Referring to FIG. 4, there is illustrated a part of a so-called “two-dimensional” structure 100, i.e. a structure periodical in both X- and Y-axes. This structure 100 is characterized by a plurality of grid cycles aligned along both the X- and Y-axes. The cycle aligned along the X-axis is formed of a pair of elements W₁ and W₂ (the stacks' widths), and the cycle aligned along the Y-axis is formed of a pair of elements G₁ and G₂ (the stacks' lengths). For example, the elements G₁ and G₂ may be, respectively, a metal layer stack and a block of Inter Layer Dielectric (ILD) stack. The measurement area S₁ defined by the diameter of the incident beam includes at least one cycle in X-direction and at least one cycle in Y-direction (several cycles in the present example).

Generally speaking, the cycle in either X- or Y-axis may be composed of several elements (e.g., stacks). If the measurement area S₁ is smaller than the surface area defined by the grid cycle along one of the axes X or Y, the total reflection (theoretical data) is determined in the manner described above with respect to the one-dimensional structure 10 (FIGS. 1a and 1 b). If the measurement area is larger than the surface area defined by the grid cycle along both X- and Y-axes, then for the total reflection R_(2D) of such a 2D-structure we have: $R_{2 - D} = {{\frac{G_{1}}{G_{1} + G_{2}} \cdot R_{G_{1}}} + {\frac{G_{2}}{G_{1} + G_{2}} \cdot R_{G_{2}}}}$

wherein R_(G1) and R_(G2) are the intensities of reflection signals from the two one-dimensional structures aligned along the Y-axis and having the widths G₁ and G₂, respectively. It should be noted that the Y-axis is no more than a notation, i.e. has no physical significance, and can be exchanged with the X-axis. For the general case of k elements in the cycle aligned along the Y-axis, we have: $R_{2\upsilon} = \frac{\sum\limits_{i = 1}^{k}{G_{i} \cdot R_{G_{i}}}}{\sum\limits_{l = 1}^{k}G_{i}}$

wherein R_(Gi) and G_(i) are the reflection intensity from and width of the i-th element.

In general, the axis location for calculating the reflection intensities R_(Gi) is chosen so as to satisfy the following: G₁ + G₂ ≻ W₁ + W₂

If the above condition is not satisfied, than the two axes are exchanged accordingly.

The main principles of a method according to the invention will now be described with reference to FIGS. 5a and 5 b. The structure of the required measurement area is examined (step 46) so as to determine whether the above measurement area condition is satisfied within the existing pattern (step 48). If this condition is not satisfied, a test-site structure satisfying the condition is designed on the reticle (step 50), the test-site being typically provided within a so-called “margin region”.

Then, an initial learning mode of operation is performed, generally at step 52. The learning mode is aimed, on the one hand, at providing the measured data and, on the other hand, at optimizing the optical model. During the learning mode, the system 14 operates in the manner described above for detecting light reflected from the illuminated area substantially at zero-order and obtaining the measured data in the form of photometric intensities of each wavelength within the wavelength range of the incident radiation (step 54). Concurrently, the processor 20 applies the above optical model for obtaining the theoretical data (step 56) and compares it to the measured data (step 58). The optical model is based on some known features of the structure and nominal values of unknown features (i.e. of the desired parameters to be measured) which are provided by the user. At this stage, the relation between the theoretical data and the measured data is compared to a certain condition (step 62). If the condition is satisfied then, correct values of the parameters A, B, C and D are calculated (step 64) and an optimized optical model is obtained (step 66). If the condition is not satisfied then the optical model factors A, B, C and D and the unknown features are adjusted (step 60) until the condition is satisfied. It should be noted, although not specifically illustrated, that at this initial learning stage, the desired parameters can be calculated.

Thereafter the measurement mode of operation is performed, generally at step 68. To this end, the measured and theoretical data are concurrently produced (steps 70 and 72, respectively). It is appreciated that the theoretical data now produced is based on the known parameters of the structure, previously calculated correct values of the optical factors A, B, C and D and on the nominal values of the desired parameters to be measured. Similarly, the optimized theoretical data is compared to the measured data so as to determine whether or not the theoretical data satisfies a required condition (step 74), e.g. the goodness of fit is of a desired value. If so, the desired parameters are calculated (step 76) and if not, the desired parameters are adjusted (step 78) until the theoretical data substantially matches the measured data. If desired, the measurement mode (step 68) is then repeated for inspecting a further location on the structure 10 (step 80).

Referring to FIGS. 6 and 7, there are illustrated in a self-explanatory manner two examples of patterned structures, designated 110 and 210, respectively, which can be inspected in the above described manner by the system 14. Each of the structures 110 and 210 consists of cells 112 and 212, respectively, each cell including two stacks formed of different layers. The parameters to be measured in these structures are, respectively, the width of a photoresist layer on top of the aluminum and the depth of the etched area (Air) within the silicon oxide layer.

Referring to FIGS. 8 and 9, there are illustrated two more example of patterned structures, designated 310 and 410, respectively, whose parameters can be measured in accordance with the invention. Here, the parameters to be measured are, respectively, the width and thickness of an aluminum layer on top of the silicon oxide and the remaining thickness of the metallic layer on the silicon oxide layer undergoing chemical mechanical polishing.

It is appreciated that polarization effects are present in the structures 310 and 410 due to the existence of patterned metal in both structures, while being weak in the structures 110 and 210.

FIG. 10 illustrates a patterned structure 510 utilizing copper under and between any two SiO₂-based layers known as Interlayer Dielectric (ILD) insulating layer. CMP process applied to such a copper-based structure 510 typically results in copper loss portions, generally at P_(i), a so-called “dishing” effect. This effect is associated with the properties of copper (e.g., softer nature as compared to other metals) and the chemical nature of the copper-based CMP process. The parameters to be measured are the depths d₁ and d₂ of, respectively, the uppermost ILD insulating layer and the dishing-associated portion P_(i). In certain cases, depending on the layer stacks, the metal thickness can be determined by (d₁-d₂).

It has been found by the inventors that the measurements can be even more optimized by taking into account the relation between the wavelength λ of incident light and the pitch size of the patterned structure (i.e., Λ=W₁+W₂ in the above examples), when selecting an optical model to be used. The above-described optical model is the optimal one for the case when Λ>λ.

Reference is now made to FIG. 11 showing a patterned structure 610 formed on a substrate layer L₁. The structure 610 is composed of a plurality of cells (grid cycles) 612, two such cells being completely shown in the figure, and can for example be formed of straight lines of metal in dielectric matrix. In this specific example, each grid cycle 612 comprises three stacks 612 a, 612 b and 612 c. Each stack is formed of different K layers having different dielectric permittivity ε. More specifically, the stack 612 a includes K layers with the dielectric permittivities ε₁(1), ε₁(2), . . . , ε₁(K), respectively; the stack 612 b includes K layers with the dielectric permittivities ε₂(1), ε₂(2), . . . , ε₂(K), respectively; and the stack 612 c includes K layers with the dielectric permittivities ε₃(1), ε₃(2), . . . , ε₃(K), respectively. An ambient, “superstrate” layer L_(o) is considered as the upper layer of the structure 610.

Thus, the entire structure to be measured is formed of j layers including the 0-th superstrate layer L₀, K stack layers, and the lower substrate layer L₁, that is, j=K+2. Index j=0 corresponds to the superstrate layer L₀, and index j=K+1 corresponds to the substrate layer L₁.

In the present example, the normal incidence of light onto the patterned structure 612 (e.g., straight lines of metal in dielectric matrix) is considered. Here, the Z-axis is perpendicular to the surface of the structure (i.e., parallel to the direction of propagation of incident light towards the structure), the X-axis is perpendicular to the lines of metal (i.e., elements of the pattern), and the Y-axis is parallel to the metal lines.

In the most general case, the structure 612 has N stacks (n=1, . . . , N), each with K layers, the width of the n-th stack being ΔX_(n). The structure pitch Λ_(X) is thereby determined as follows: $\Lambda_{X} = {\sum\limits_{n = 1}^{N}{\Delta \quad X_{n}}}$

The interaction of the incident light with each layer can be described using the effective permittivity tensor: $ɛ = \begin{pmatrix} ɛ_{X} & 0 \\ 0 & ɛ_{Y} \end{pmatrix}$

The above tensor describes the case of relatively small values of pitch Λ_(X) (the period of grating along the X-axis), as compared to the wavelength of incident light λ, i.e., Λ_(X)/λ<1. The tensor components ε_(X) and ε_(Y) correspond to the electric field vector parallel to the X-axis and Y-axis, respectively (i.e., perpendicular (TM) and parallel (TE) to the metal lines, respectively).

Keeping in mind that the structure under measurements is composed of a plurality of layers, the components of the effective permittivity tensor, ε_(X)(j) and ε_(Y)(j), in j-th layer have the form: $\left\lbrack {ɛ_{X}(j)} \right\rbrack^{- 1} = {\sum\limits_{n = 1}^{N}{\left\lbrack {ɛ_{n}(j)} \right\rbrack^{- 1}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}$ ${ɛ_{Y}(j)} = {\sum\limits_{n = 1}^{N}{{ɛ_{n}(j)}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}$

wherein ε_(n)(j) is the permittivity of j-th layer in n-th stack; ΔX_(n) is the width of the n-th stack; N is the number of stacks.

The total reflectivity of the structure R_(TOT) has the form:

R _(TOT) =ψ|R _(X)(0)|²+(1−ψ)|R _(Y)(0)|²

wherein ψ describes the polarization of light: ψ=0 for light polarized along Y-axis, ψ=1 for light polarized along the X-axis, ψ=0.5 for unpolarized light; R_(X)(0) and R_(Y)(0) are reflectivity amplitudes of the entire structure along the X- and Y-axis, respectively, which are functions of the effective permittivity.

The reflectivity amplitudes R_(X)(j) and R_(Y)(j) can be determined using the following recurrent expressions: ${R_{X}(j)} = \frac{{r_{X}(j)} + {{R_{X}\left( {j + 1} \right)}{\exp \left\lbrack {{- 2}\quad i\quad {\sigma_{X}\left( {j + 1} \right)}} \right\rbrack}}}{1 + {{r_{X}(j)}{R_{X}\left( {j + 1} \right)}{\exp \left\lbrack {{- 2}\quad i\quad {\sigma_{X}\left( {j + 1} \right)}} \right\rbrack}}}$ for  j = K, K − 1, …  , 1, 0. ${R_{Y}(j)} = \frac{{r_{Y}(j)} + {{R_{Y}\left( {j + 1} \right)}{\exp \left\lbrack {{- 2}\quad i\quad {\sigma_{Y}\left( {j + 1} \right)}} \right\rbrack}}}{1 + {{r_{Y}(j)}{R_{Y}\left( {j + 1} \right)}{\exp \left\lbrack {{- 2}\quad i\quad {\sigma_{Y}\left( {j + 1} \right)}} \right\rbrack}}}$ for  j = K, K − 1, …  , 1, 0.

wherein r_(X)(j) and r_(Y)(j) are reflectivity amplitudes of each of the j layers, σ_(X)(j) and σ_(Y)(j) are complex coefficients showing both the attenuation and phase shift of the TM or TE light within the j-th layer, and are determined as follows: ${r_{X}(j)} = \frac{\sqrt{ɛ_{X}(j)} - \sqrt{ɛ_{X}\left( {j + 1} \right)}}{\sqrt{ɛ_{X}(j)} - \sqrt{ɛ_{X}\left( {j + 1} \right)}}$ ${r_{Y}(j)} = \frac{\sqrt{ɛ_{Y}(j)} - \sqrt{ɛ_{Y}\left( {j + 1} \right)}}{\sqrt{ɛ_{Y}(j)} - \sqrt{ɛ_{Y}\left( {j + 1} \right)}}$ ${\sigma_{X}(j)} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{X}(j)}}$ ${\sigma_{Y}(j)} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{Y}(j)}}$

In the above equations, d(j) is the thickness (depth) of the j-th layer

The reflectivity amplitudes r_(X)(j) and r_(Y)(j) do not take into account the interference of waves reflected from different layers. They describe the reflectivity from the interface between the j-th and (J+1)-th substances only. In other words, they correspond to the reflectivity from the interface of two semi-infinite volumes with the permittivities ε_(X)(j) and ε_(X)(j+1) in the case of TM polarization, and with the permittivities ε_(Y)(j) and ε_(Y)(j+1) for TE polarization.

On the other hand, the reflectivity amplitudes R_(X)(j) and R_(Y)(j) describe the reflectivity from the (j+1)-th layer with taking into account the interference of the waves reflected from the interfaces between the different layers. Thus, R_(X)(0) and R_(Y)(0) correspond to the reflectivity from the upper layer L₀ of the measured structure for TM and TE polarizations, respectively.

As indicated above, the complex coefficients σ_(X)(j) and σ_(Y)(j) show both attenuation and phase shift of the TM or TE light within the j-th layer: the real part of σ describes the phase shift, and the imaginary part of σ describes the attenuation coefficient.

For the reflectivity amplitudes, the complex coefficients, and permettivity of the 0-th and (K+1)-th layers, we have: $\begin{matrix} {{R_{X}\left( {K + 1} \right)} = 0} & {{R_{Y}\left( {K + 1} \right)} = 0} \\ {{\sigma_{X}\left( {K + 1} \right)} = 0} & {{\sigma_{Y}\left( {K + 1} \right)} = 0} \\ {{ɛ_{X}(0)} = ɛ_{L0}} & {{ɛ_{Y}(0)} = ɛ_{L0}} \\ {{ɛ_{X}\left( {K + 1} \right)} = ɛ_{L1}} & {{ɛ_{Y}\left( {K + 1} \right)} = ɛ_{L1}} \end{matrix}$

Let us generalize this approach to take into account one more possible case of so-called “middle pitches” with respect to the wavelength of incident light, i.e., Λ˜λ. In this case, the reflectivity amplitudes are functions of the effective permittivity of each n-th stack in j-th layer. The effective permettivity tensor has the form: ${ɛ\left( {j,n} \right)} = \begin{pmatrix} {ɛ_{X}\left( {j,n} \right)} & 0 \\ 0 & {ɛ_{Y}\left( {j,n} \right)} \end{pmatrix}$

wherein ε_(X)(j, n) and ε_(Y)(j, n) are as follows:

ε_(X)(j,n)=ε_(X)(j)+α(λ)[ε_(n)(j)−ε_(X)(j)]

ε_(Y)(j,n)=ε_(Y)(j)+α(λ)[ε_(n)(j)−ε_(Y)(j)]

Here, α(λ) is the coefficient, which is the monotonically decreasing function of wavelength of incident light λ, and is indicative of the effect of “mixing” of the two limiting cases Λ>λ and Λ<λ; α=0 for the case of Λ<λ, α=1 for the case Λ>λ, and 0<α<1 for the case of Λ˜λ.

Using the analogous formulas for each stack (rather than each layer), the total reflectivity R_(TOT) can be expressed as follows: $R_{TOT} = {{\psi {{\sum\limits_{n = 1}^{N}{{R_{X}\left( {0,n} \right)}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}}^{2}} + {\left( {1 - \psi} \right){{\sum\limits_{n = 1}^{N}{{R_{Y}\left( {0,n} \right)}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}}^{2}}}$

wherein N is the number of stacks; R_(X)(j,n) and R_(Y)(j,n) are the reflectivity amplitudes for n-th stack, and can be obtained using the previous recurrent equations, but utilizing the values of ε_(X)(j,n) and ε_(Y)(j,n) instead of values of ε_(X)(j) and ε_(Y)(j), respectively.

The reflectivity amplitudes R_(X)(j,n) and R_(Y)(j,n) can be obtained using the following recurrent expressions: ${{R_{X}\left( {j,n} \right)} = {{{\frac{{r_{X}\left( {j,n} \right)} + {{R_{X}\left( {{j + 1},n} \right)}{\exp \left\lbrack {{- 2}i\quad {\sigma_{X}\left( {{j + 1},n} \right)}} \right\rbrack}}}{1 + {{r_{X}\left( {j,n} \right)}{R_{X}\left( {{j + 1},n} \right)}{\exp \left\lbrack {{- 2}i\quad {\sigma_{X}\left( {{j + 1},n} \right)}} \right\rbrack}}}.{for}}\quad j} = K}},{K - 1},\ldots \quad,1,0$ ${{R_{Y}\left( {j,n} \right)} = {{{\frac{{r_{Y}\left( {j,n} \right)} + {{R_{Y}\left( {{j + 1},n} \right)}{\exp \left\lbrack {{- 2}i\quad {\sigma_{Y}\left( {{j + 1},n} \right)}} \right\rbrack}}}{1 + {{r_{Y}\left( {j,n} \right)}{R_{Y}\left( {{j + 1},n} \right)}{\exp \left\lbrack {{- 2}i\quad {\sigma_{Y}\left( {{j + 1},n} \right)}} \right\rbrack}}}.{for}}\quad j} = K}},{K - 1},\ldots \quad,1,0$ ${r_{X}\left( {j,n} \right)} = \frac{\sqrt{ɛ_{X}\left( {j,n} \right)} - \sqrt{ɛ_{X}\left( {{j + 1},n} \right)}}{\sqrt{ɛ_{X}\left( {j,n} \right)} - \sqrt{ɛ_{X}\left( {{j + 1},n} \right)}}$ ${r_{Y}\left( {j,n} \right)} = \frac{\sqrt{ɛ_{Y}\left( {j,n} \right)} - \sqrt{ɛ_{Y}\left( {{j + 1},n} \right)}}{\sqrt{ɛ_{Y}\left( {j,n} \right)} - \sqrt{ɛ_{Y}\left( {{j + 1},n} \right)}}$ ${\sigma_{X}\left( {j,n} \right)} = {{\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{X}\left( {j,n} \right)}{\sigma_{Y}\left( {j,n} \right)}} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{Y}\left( {j,n} \right)}}}$

As indicated above, the reflectivity amplitudes r_(X)(j,n) and r_(Y)(j,n) do not take into account the interference of the waves reflected from different layers. They describe the reflectivity from the interface between the j-th and (j+1)-th substances only. In other words, they correspond to the reflectivity from the interface of two semi-infinite volumes with the permittivities ε_(X)(j, n) and ε_(X)(j+1, n) in the case of TM polarization, and with the permittivities ε_(Y)(j, n) and ε_(Y)(j+1, n) in the case of TE polarization. The reflectivity amplitudes R_(X)(j,n) and R_(Y)(j,n) describe the reflectivity from the (j+1)-th layer within the n-th stack with taking into account the interference of the waves reflected from interfaces between the different layers. Thus, R_(X)(0,n) and R_(Y)(0,n) correspond to the reflectivity from the n-th stack in the upper layer of the measured structure for TM and TE polarizations, respectively.

The complex coefficients σ_(X)(j,n) and σ_(Y)(j,n) show both the attenuation and phase shift of the TM or TE light within the n-th stack of the j-th layer: the real part of σ describes the phase shift, and the imaginary part of σ describes the attenuation coefficient.

Index j=0 corresponds to the superstrate layer L₀, Index j=K+1 corresponds to the substrate layer L₁. Thus, we have: $\begin{matrix} {{R_{X}\left( {{K + 1},n} \right)} = 0} & {{R_{Y}\left( {{K + 1},n} \right)} = 0} \\ {{\sigma_{X}\left( {{K + 1},n} \right)} = 0} & {{\sigma_{Y}\left( {{K + 1},n} \right)} = 0} \\ {{ɛ_{X}\left( {0,n} \right)} = ɛ_{L0}} & {{ɛ_{Y}\left( {0,n} \right)} = ɛ_{L0}} \\ {{ɛ_{X}\left( {{K + 1},n} \right)} = ɛ_{L1}} & {{ɛ_{Y}\left( {{K + 1},n} \right)} = ɛ_{L1}} \end{matrix}$

Referring to FIG. 12, there is illustrated a patterned structure 710 composed of grid cycles 712 (two such cycles being shown in the figure), which are formed on a substrate layer L₁ and are covered by a superstrate air layer L₀. In this specific example, the Cu-polishing during the dual Damascene process is considered. The grid cycle 712 consists of two stacks 712 a and 712 b of equal widths ΔX₁ and ΔX₂, respectively, such that ΔX₁=ΔX₂=Λ/2, Λ being the pitch.

In this case, the number K of stack layers is 3, and the total number of layers including the substrate and superstrate layers is 5, i.e., j=0,1,2,3,4. The condition of j=4 corresponds to the substrate layer L₁ (Si). The condition of j=3 corresponds to the lower oxide layer (SiO₂) with the thickness of about 5000A. The condition of j=2 corresponds to an etch stop layer (Si₃N₄) with the thickness of about 1000A. The condition of j=1 is the metal-containing layer (Cu in the first stack, and SiO₂ in the second stack) with the thickness of about 5000A. The condition of j=0 corresponds to the superstrate layer L₀ (water/air layer).

In this case, we have the trivial results for the layers 0, 2, 3, and 4, as follows: $\begin{matrix} {{ɛ_{X}\left( {4,1} \right)} = {{ɛ_{X}\left( {4,2} \right)} = {ɛ({Si})}}} & {{ɛ_{Y}\left( {4,1} \right)} = {{ɛ_{Y}\left( {4,2} \right)} = {ɛ({Si})}}} \\ {{ɛ_{X}\left( {3,1} \right)} = {{ɛ_{X}\left( {3,2} \right)} = {ɛ\left( {SiO}_{2} \right)}}} & {{ɛ_{Y}\left( {3,1} \right)} = {{ɛ_{Y}\left( {3,2} \right)} = {ɛ\left( {SiO}_{2} \right)}}} \\ {{ɛ_{X}\left( {2,1} \right)} = {{ɛ_{X}\left( {2,2} \right)} = {ɛ\left( {{Si}_{3}N_{4}} \right)}}} & {{ɛ_{Y}\left( {2,1} \right)} = {{ɛ_{Y}\left( {2,2} \right)} = {ɛ\left( {{Si}_{3}N_{4}} \right)}}} \\ {{ɛ_{X}\left( {0,1} \right)} = {{ɛ_{X}\left( {0,2} \right)} = {{ɛ({Air})} = 1}}} & {{ɛ_{Y}\left( {0,1} \right)} = {{ɛ_{Y}\left( {0,2} \right)} = {{ɛ({Air})} = 1}}} \end{matrix}$

For the metal-containing layer (j=1) with periodic structure (grating) we have the following results: ɛ_(X)(1) = 1/(0.5/ɛ(Cu) + 0.5/ɛ(SiO₂)) − average  permittivity  for  TM  polarization $\begin{matrix} {{ɛ_{X}\left( {1,1} \right)} = {{ɛ_{X}(1)} + {{\alpha (\lambda)}\left\lbrack {{ɛ({Cu})} - {ɛ_{X}(1)}} \right\rbrack}}} & {{ɛ_{X}\left( {1,2} \right)} = {{ɛ_{X}(1)} + \alpha}} \end{matrix}{(\lambda)\left\lbrack {{ɛ\left( {SiO}_{2} \right)} - {ɛ_{X}(1)}} \right\rbrack}$ ɛ_(Y)(1) = 0.5ɛ(Cu) + 0.5ɛ(SiO₂) − average  permittivity  for  TE  polarization $\begin{matrix} {{ɛ_{Y}\left( {1,1} \right)} = {{ɛ_{Y}(1)} + {{\alpha (\lambda)}\left\lbrack {{ɛ({Cu})} - {ɛ_{Y}(1)}} \right\rbrack}}} & {{ɛ_{Y}\left( {1,2} \right)} = {{ɛ_{Y}(1)} + \alpha}} \end{matrix}{(\lambda)\left\lbrack {{ɛ\left( {SiO}_{2} \right)} - {ɛ_{Y}(1)}} \right\rbrack}$

Here, α(λ) is the coefficient indicative of the “mixing” of two cases: Λ>λ and Λ<λ. The value α(λ) depends on the ratio between the pitch and the wavelength. Let us present the coefficient α(λ) as a linear function of wavelength λ (in nm):

α(λ)=α₅₀₀+(α₉₀₀−α₅₀₀)(λ−500)/(900−500)

If α(λ)<0, then α(λ)=0. If α(λ)>1, then α(λ)=1. Here, α₅₀₀ and α₉₀₀ are the values of the coefficient α for the wavelength λ equal to 500 nm and 900 nm, respectively. For measured wavelength ranging between 500 nm and 900 nm, the approximate values of α₅₀₀ and α₉₀₀ for different values of pitch may be as follows:

For Λ<0.20 μm, α₅₀₀=0.0 and α₉₀₀=0.0 (Λ<λ);

For Λ=0.32 μm, α₅₀₀≈0.1 and α₉₀₀≈0.2;

For Λ=0.70 μm, α₅₀₀≈0.5 and α₉₀₀≈0.7;

For Λ=1.50 μm, α₅₀₀≈0.7 and α₉₀₀≈0.9;

For Λ>4.00 μm, α₅₀₀=1.0 and α₉₀₀=1.0 (Λ>λ)

The above values are presented as non-limiting examples only. It should be understood that the values of α₅₀₀ and α₉₀₀ are considered as fitting parameters, because they depend not only on the ratio Λ/λ, but on the geometry of the structure as well (metal density, optical constants, substances, exact stack structure, roughness of the interface between the different stacks, etc.). These parameters α₅₀₀ and α₉₀₀ should be optimized once per each structure, and after fixing these optimized values, they should be maintained constant during the measurements.

In the previous examples of FIGS. 11 and 12, one-dimensional periodicity in the layers was considered. To measure the oxide thickness in DRAM applications, however, the spectra from layers with two-dimensional structure within the layers have to be analyzed. To this end, the previous one-dimensional consideration can be easily generalized to the case of the alternative (perpendicular) lines (metal lines) in different layers, i.e. the two-dimensional stack.

FIGS. 13A-13C illustrate a structure 810 of the typical DRAM application. FIG. 13A is a top view of the structure. FIGS. 13B and 13C are cross-sectional views taken along lines A—A and B—B, respectively.

Lines L⁽¹⁾ correspond to the STI (Shallow Trench Isolation), i.e., lines of SiO₂ within a lower level L₁ of Si substrate, and lines L⁽²⁾ correspond to the DRAM gate stack in the middle layer L₂ (SiN/Oxide/WSi/Poly/Gate Oxide stack). An upper layer L₃ of oxide is covered by an ambient air/water layer L₀.

In this case, the average permittivities ε_(X)(j) and ε_(Y)(j) of each layer for electric field vector being parallel to the X-axis and Y-axis, respectively, can be calculated. If the lines in the j-th layer are parallel to Y-axis, we have: $\left\lbrack {ɛ_{X}(j)} \right\rbrack^{- 1} = {\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{\left\lbrack {ɛ_{n\quad m}(j)} \right\rbrack^{- 1}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}$ ${ɛ_{Y}(j)} = {\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{{ɛ_{n\quad m}(j)}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}$

If the lines in the j-th layer area are parallel to the X-axis, we have: ${ɛ_{X}(j)} = {{\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{{ɛ_{n\quad m}(j)}{\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}\left\lbrack {ɛ_{Y}(j)} \right\rbrack}^{- 1}}}} = {\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{\left\lbrack {ɛ_{n\quad m}(j)} \right\rbrack^{- 1}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}}$

In the above equations, N is the number of stacks extending along the X-axis; M is the number of stacks extending along the Y-axis; ΔX_(n) is the width of the n-th stack along the X-axis; ΔY_(m) is the width of the m-th stack along the Y-axis; ε_(nm)(j) is the permittivity of a cell in the j-th layer, the cell belonging to the n-th stack along the X-axis and the m-th stack along the Y-axis; Λ_(X) and Λ_(Y) are the pitches along the X-axis and Y-axis, respectively.

The structure pitches Λ_(Y) and Λ_(X) are determined as follows: ${\Lambda \quad y} = {\sum\limits_{m = 1}^{M}{\Delta \quad Y_{m}}}$

For the specific example of FIGS. 13A-13B, we have: N=2; M=2; ΔX₁=0.3 nm; ΔX₂0.2 nm; ΔY₁=0.1 nm; ΔY₂=0.1 nm; Λ_(X)=ΔX₁+ΔX₂=0.5 nm; Λ_(Y)=ΔY₁+ΔY₂=0.2 nm

The total reflectivity R_(TOT) has the following form: $R_{TOT} = {{\psi {{\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{{R_{X}\left( {0,n,m} \right)}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}}^{2}} + {\left( {1 - \psi} \right){{\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{{R_{Y}\left( {0,n,m} \right)}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}}^{2}}}$

wherein ψ describes the polarization of light, the condition ψ=0 corresponding to light polarized along Y-axis, the condition ψ=1 corresponding to light polarized along the X-axis, the condition ψ=0.5 corresponding to unpolarized light.

The reflectivity amplitudes R_(X)(j,n,m) and R_(Y)(j,n,m) from different two-dimensional stacks (n,m) can be calculated using the above equations, but with the permittivities ε_(X)(j,n,m) and ε_(Y)(j,n,m), wherein index j describes the layer, indices n and m describe those stacks along the X- and Y-axes to which the (n,m)-substack belongs. The permittivities ε_(X)(j,n,m) and ε_(Y)(j,n,m) are calculated as follows:

ε_(X)(j,n,m)=ε_(X)(j)+α(λ)[ε_(nm)(j)−ε_(X)(j)]

ε_(Y)(j,n,m)=ε_(Y)(j)+α(λ)[ε_(nm)(j)−ε_(Y)(j)]

Thus, by taking into account the relation between the wavelength of incident light and the structure pitch, the mostly preferred optical model can be selected accordingly and applied to carry out measurements of the structure parameters.

Those skilled in the art will readily appreciate that many modifications and changes may be applied to the invention as hereinbefore exemplified without departing from its scope defined in and by the appended claims. For example, the patterned structure may comprise any number of cells, each cell being formed of any number of stacks. In the method claims that follow, characters, which are used to designate claim steps, are provided for convenience only and do not apply any particular order of performing the steps. 

What is claimed is:
 1. A method for measuring at least one desired parameter of a patterned structure which represents a grid having at least one cycle formed of at least two locally adjacent elements having different optical properties in respect of incident radiation, the structure having a plurality of features defined by a certain process of its manufacturing, the method comprising the steps of: a) providing an optical model, which is based on at least some of said features of the structure and on relation between wavelength range of the incident radiation to be used for measurements and pitch of the structure under measurements, and is capable of determining theoretical data representative of photometric intensities of light components of different wavelengths specularly reflected from the structure and of calculating said at least one desired parameter of the structure; b) locating a measurement area for applying thereto spectrophotometric measurements, wherein said measurement area is a grid cycles containing area and is substantially larger than a surface area of the structure defined by one grid cycle; c) applying the spectrophotometric measurements to said measurement area by illuminating it with incident radiation of a preset substantially wide wavelength range, detecting light component substantially specularly reflected from the measurement area, and obtaining measured data representative of photometric intensities of each wavelength within said wavelength range; d) analyzing the measured data and the theoretical data and optimizing said optical model until said theoretical data satisfies a predetermined condition; and e) upon detecting that the predetermined condition is satisfied, calculating said at least one parameter of the structure.
 2. The method according to claim 1, wherein said at least some features of the structure on which the optical model is based are available prior to measurements.
 3. The method according to claim 1, wherein said at least some features of the structure on which the optical model is based comprises nominal values of said desired parameters to be measured.
 4. The method according to claim 1, wherein said at least some features of the structure on which the optical model is based comprises materials forming each of said at least two elements.
 5. The method according to claim 1, wherein the step of providing the optical model comprises the step of: estimating known optical effects, that may be produced in the structure in response to the incident radiation, and contribution of said optical effects into the detected light component.
 6. The method according to claim 1, wherein said analyzing comprises the step of: comparing the theoretical data with the measured data and providing data indicative of the relationship between the measured and theoretical data.
 7. The method according to claim 1, wherein said optimizing comprises the steps of: adjusting certain variable factors of the optical model until the theoretical data satisfies the predetermined condition and obtaining correct values of the optical model factors.
 8. The method according to claim 7, wherein said certain variable factors of the optical model define contributions of known optical effects into the detected light component.
 9. The method according to claim 1, wherein said predetermined condition represents a merit function defining a certain value of a goodness of fit between the measured and theoretical data.
 10. The method according to claim 1, wherein said optimizing comprises the step of: varying a value of said at least one desired parameter until the theoretical data satisfies the predetermine condition.
 11. The method according to claim 1, wherein the measurement area is a part of the structure to be measured.
 12. The method according to claim 1, wherein the measurement area is located on a test site representing a test pattern similar to that of the structure, the test pattern having the same design rules and layer stacks.
 13. The method according to claim 1, wherein said structure is composed of n locally adjacent elements with j layers, and said photometric intensities obtained with the theoretical data are functions of effective permittivity ε of the structure.
 14. The method according to claim 13, wherein said theoretical data is determined according to the following equation: R _(TOT) =ψ|R _(X)(0)|²+(1−ψ)|R _(Y)(0)|² wherein ψ describes the polarization of light: ψ=0 for light polarized along the Y-axis, ψ=1 for light polarized along the X-axis, ψ=0.5 for unpolarized light; R_(X)(0) and R_(Y)(0) are reflectivity amplitudes of the entire structure along the X- and Y-axis, respectively.
 15. The method according to claim 14, wherein the effective permittivity is described by the following tensor: $ɛ = \begin{pmatrix} ɛ_{x} & 0 \\ 0 & ɛ_{y} \end{pmatrix}$

wherein tensor components ε_(X)(j) and ε_(Y)(j) correspond to electric field vector parallel to the X-axis and Y-axis, respectively, and are as follows: $\left\lbrack {ɛ_{X}(j)} \right\rbrack^{- 1} = {\sum\limits_{n = 1}^{N}{\left\lbrack {ɛ_{n}(j)} \right\rbrack^{- 1}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}$ ${ɛ_{Y}(j)} = {\sum\limits_{n = 1}^{N}{{ɛ_{n}(j)}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}$

wherein ε_(n)(j) is the permittivity of j-th layer in n-th stack; ΔX_(n) is the width of the n-th stack; N is the number of stacks.
 16. The method according to claim 15, wherein said reflectivity amplitudes are determined by recurrent utilizing said tensor components.
 17. The method according to claim 13, wherein the effective permittivity is described by the following tensor: ${ɛ\left( {j,n} \right)} = \begin{pmatrix} {ɛ_{X}\left( {j,n} \right)} & 0 \\ 0 & {ɛ_{Y}\left( {j,n} \right)} \end{pmatrix}$

wherein tensor components ε_(X)(j, n) and ε_(Y)(j, n) are as follows: ε_(X)(j,n)=ε_(X)(j)+α(λ)[ε_(n)(j)−ε_(X)(j)] ε_(Y)(j,n)=ε_(Y)(j)+α(λ)[ε_(n)(j)−ε_(Y)(j)] wherein α(λ) is the coefficient presenting a monotonically decreasing function of wavelength of incident radiation λ; α=0 when the structure pitch Λ is smaller than the wavelength of incident light λ, and α=1 when the structure pitch Λ is larger than the wavelength of incident light λ, and 0<α<1 when Λ˜λ.
 18. The method according to claim 17, wherein said theoretical data is determined according to the following equation: $R_{TOT} = {{\psi {{\sum\limits_{n = 1}^{N}{{R_{X}\left( {0,n} \right)}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}}^{2}} + {\left( {1 - \psi} \right){{\sum\limits_{n = 1}^{N}{{R_{Y}\left( {0,n} \right)}\frac{\Delta \quad X_{n}}{\Lambda_{X}}}}}^{2}}}$

wherein ψ describes the polarization of light: ψ=0 for light polarized along the Y-axis, ψ=1 for light polarized along the X-axis, ψ=0.5 for unpolarized light; N is the number of stacks; R_(X)(0,n) and R_(Y)(0,n) are the reflectivity amplitudes for entire n-th stack.
 19. The method according to claim 18, wherein said reflectivity amplitudes are determined by recurrent equations utilizing said tensor components.
 20. The method according to claim 13, wherein said at least one cycle is two-dimensional formed by said n different locally adjacent elements aligned along X-axis, and m different locally adjacent elements aligned along Y-axis.
 21. The method according to claim 20, wherein said theoretical data being determined by the following equation: $R_{TOT} = {{\psi {{\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{{R_{X}\left( {0,n,m} \right)}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}}^{2}} + {\left( {1 - \psi} \right){{\sum\limits_{n = 1}^{N}{\sum\limits_{m = 1}^{M}{{R_{Y}\left( {0,n,m} \right)}\frac{\Delta \quad X_{n}\Delta \quad Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}}^{2}}}$

wherein ψ describes the polarization of light, the condition ψ=0 corresponding to light polarized along the Y-axis, the condition ψ=1 corresponding to light polarized along the X-axis, the condition ψ=0.5 corresponding to unpolarized light; R_(X)(0,n,m) and R_(Y)(0,n,m) are reflectivity amplitudes from all different two-dimensional elements of the structure.
 22. The method according to claim 21, wherein said reflectivity amplitudes are determines by recurrent equations utilizing components of the effective permittivity tensor ε_(X)(j,n,m) and ε_(Y)(j,n,m), which are as follows: ε_(X)(j,n,m)=ε_(X)(j)+α(λ)[ε_(nm)(j)−ε_(X)(j)] ε_(Y)(j,n,m)=ε_(Y)(j)+α(λ)[ε_(nm)(j)−ε_(Y)(j)]
 23. The method according to claim 1, wherein said at least one desired parameter to be measured is a width of at least one of said at least two locally adjacent elements in the grid cycle.
 24. The method according to claim 1, wherein said at least one desired parameter to be measured is a depth of at least one layer of said at least one stack.
 25. The method according to claim 1, wherein said at least one desired parameter to be measured is a depth of a metal-loss portion resulting from a chemical mechanical polishing applied to said structure.
 26. The method according to claim 1, wherein said patterned structure is a semiconductor wafer.
 27. The method according to claim 1, wherein said manufacturing process of Chemical Mechanical Planarization (CMP).
 28. An apparatus for measuring at least one desired parameter of a patterned structure that represents a grid having at least one grid cycle formed of at least two locally adjacent elements having different optical properties in respect of an incident radiation, the structure having a plurality of features defined by a certain process of its manufacturing, the apparatus comprising: a spectrophotometer illuminating a measurement area by an incident radiation of a preset substantially wide wavelength range and detecting a specular reflection light component of light reflected from the measurement area for providing measured data representative of photometric intensities of detected light within said wavelength range, wherein the measurement area is substantially larger than a surface area of the structure defined by the grid cycle; and a processor unit coupled to the spectrophotometer, the processor unit comprising a pattern recognition software and a translation means so as to be responsive to said measured data and locate measurements, the processor being operable for applying an optical model, based on at least some of said features of the structure and on relation between wavelength range of the incident radiation to be used for measurements and pitch of the structure under measurements, for providing theoretical data representative of photometric intensities of light specularly reflected from the structure within said wavelength range and calculating said at least one desired parameter, and comparing said measured and theoretical data and detecting whether the theoretical data satisfies a predetermined condition.
 29. The apparatus according to claim 28, wherein said spectrophotometer comprises a spectrophotometric detector and a variable aperture stop located in the optical path of light reaching the detector, the diameter of the aperture stop being variable in accordance with the grid cycle of the measured structure.
 30. The apparatus according to claim 28, wherein said measurement area is located within the patterned area of said structure.
 31. The apparatus according to claim 28, wherein said measurement area is located within a test site located outside the patterned area of said structure.
 32. The apparatus according to claim 28, wherein each of said at least two elements is a stack including layers having different optical properties.
 33. The apparatus according to claim 28, wherein said at least one desired parameter to be measured is a width of at least one of said at least two locally adjacent elements in the grid cycle.
 34. The apparatus according to claim 32, wherein said at least one desired parameter to be measured is a depth of at least one layer of said at least one stack.
 35. The apparatus according to claim 32, wherein said at least one desired parameter to be measured is a depth of a metal-loss portion resulting from a chemical mechanical polishing applied to said structure.
 36. A working station for processing wafers progressing on a production line, wherein each of said wafers is a patterned structure that represents a grid having at least one grid cycle formed of at least two locally adjacent elements having different optical properties in respect of an incident radiation, and the structure has a plurality of features defined by a certain process of its manufacturing, the working station comprising an inspection apparatus and a support frame for supporting the wafer within an inspection plane, wherein the inspection apparatus comprises: a spectrophotometer illuminating a measurement area by an incident radiation of a preset substantially wide wavelength range and detecting a specular reflection light component of light reflected from the measurement area for providing measured data representative of photometric intensities of detected light within said wavelength range, wherein the measurement area is substantially larger than a surface area of the structure defined by the grid cycle; and a processor unit coupled to the spectrophotometer, the processor unit comprising a pattern recognition software and a translation means so as to be responsive to said measured data and locate measurements, the processor being operable for applying an optical model, based on at least some of said features of the structure and on relation between wavelength range of the incident radiation to be used for measurements and pitch of the structure under measurements, for providing theoretical data representative of photometric intensities of light specularly reflected from the structure within said wavelength range and calculating said at least one desired parameter, and comparing said measured and theoretical data and detecting whether the theoretical data satisfies a predetermined condition.
 37. The working station according to claim 36, wherein said production line comprises a Chemical Mechanical Planarization (CMP) tools arrangement. 